Update manager for database system

ABSTRACT

A database management protocol exchanges update tokens between a client and a server on which the database resides. When a client requests data to be read from a database, an update manager either reads an update token stored therein or generates one dynamically. The update token represents a current state of the data object being read. Sometime thereafter, if the client requests new data to be stored in the object, the client may furnish the update token back to the update manager. The update manager compares the client&#39;s update token to a local update token representing a current state of the database and, if they match, determines that the state has not changed. If they do not match, an error results.

BACKGROUND

Embodiments of the present invention relate to a management system for databases and, in particular, for managing multiple concurrent and possibly inconsistent requests from multiple users to change common data records.

Modern businesses use computers across almost all business functions. Computer systems model business transactions and automatically create and update data records in databases. A bank, for example, uses a computer system to open commercial loans. Database(s) in the bank's computer system serves as a repository for information on the bank's customers, terms and conditions of loans extended to those customers, customer payment history, etc. Bank employees typically must interact with the computer system to open new loans before they are approved and money is extended to a customer. As the computer system advances through its operations, it creates and updates several data objects. This is but one example; computer systems develop data records as they hire and fire employees, issue purchase orders, provide quotes to customers, arrange for product shipments and design products, among others. Many firms' computer systems provide enterprise management functions, which represent an integration of a several business and financial applications and, of course, underlying data sets.

In such systems, computer applications often field requests from a variety of computer users, which involve requests to read, update and store data in databases. The various computer users may operate independently, unaware of the activity of other users. At times, multiple users may issue concurrent requests directed to a common data object within a database. If the concurrent requests merely require data to be read from a database, typically no adverse consequences arise. If the requests, however, require data in the database to be changed, performance issues can be implicated.

Consider a simple example where two users both read and update customer data. Both users have local copies of customer address data on their computers. A first user enters a change of address representing the customer's relocation from one city to another. The data is stored in the database. Afterward, the second user notices a typographical error in the now-stale customer address (say, the city name) and corrects it. Hypothetically, the second user may enter a command that causes only the city field to be stored in the database. If the second user's command were permitted to proceed, a data inconsistency may occur because the street, state and ZIP CODE fields in the customer record may contain data as the first user had specified it but the city field will contain obsolete data.

Various database management protocols are known but they typically require control over the design of the database itself. Such protocols are inappropriate for many modern computer systems which are assembled from a variety of heterogeneous applications and databases. Accordingly, there is a need in the art for a database management protocol that is non-invasive—it works equally as well with databases that have native update controls and those that do not.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 is a simplified diagram of a computer system according to an embodiment of the present invention.

FIG. 2 illustrates a method according to an embodiment of the present invention.

FIG. 3 illustrates another method according to an embodiment of the present invention.

FIG. 4 is a data flow diagram according to an embodiment of the present invention.

DETAILED DESCRIPTION

Embodiments of the present invention provide a database management protocol that exchanges update tokens between a client and a server on which the database resides. When a client requests data to be read from a database, an update manager either reads an update token stored therein or generates one dynamically. The update token represents a current state of the data object being read. Sometime thereafter, if the client requests new data to be stored in the object, the client may furnish the update token back to the update manager. The update manager compares the client's update token to a local update token representing a current state of the database and, if they match, determines that the state has not changed. If they do not match, an error results.

FIG. 1 illustrates a computer system suitable for use with the present invention. The computer system 100 is shown as including one or more terminals 110 and servers 120 interconnected by a network 130. The system 100 may include various types of terminals 110 such as desktop and portable computer platforms, which may execute program applications as appropriate to satisfy the needs of a proprietor of the computer network. The sever(s) 120 may execute application programs in a centralized or distributed execution environment as appropriate to the proprietor's needs. Various network 130 topologies are known such as local area networks, wide area networks, virtual private networks and the like. Unless otherwise noted herein, variations among types of terminals 110, types of servers 120 and types of networks 130 are immaterial to the present discussion. For purposes of the present discussion, it is sufficient to note that operators at the terminals 110 may access and update data stored by databases within the servers 120.

As noted the server(s) 120 may execute application programs in a centralized or distributed manner. FIG. 1 illustrates several exemplary applications 130, which may include an execution engine 132 and supporting database 134. In highly complex execution environments, when several different applications contribute to an integrated execution environment, the applications 130 may be considered “backend” applications, each dedicated to a its own feature set within the integrated system. Such systems also may include a system “front end” application 140, which fields requests from operators and coordinates among the backend applications to perform the requests. In this regard, the operation of complex network systems is well known.

Embodiments of the present invention introduce an update manager 136 for use in a computer system. Particularly in systems with a large number of users, multiple users may access common data records simultaneously. Often, the users merely read stored data records for use in their assigned tasks. Users also may read data records and update them. Sometimes, multiple users may read and attempt to update the same data records simultaneously. Left unchecked, such attempts can introduce data inconsistencies which can create performance impairments for the system 100. The update manager 136 manages user attempts to store new data in data records to prevent such inconsistencies.

In the embodiment illustrated in FIG. 1, the update managers 136 are shown as components within application engines 132 and replicated within each backend application 130 of the system. In other embodiments (not shown), the system may employ a central update manager 136 which is inter-operable with each of the applications 130 for which an update control mechanism is desired.

To implement the update control policies, when a client requests a read of data from the database, the server 120 generates an “update token” and sends it to the client along with the requested data being read. The update token identifies the data of the data object from which data has been read. When a client 110 requests a write of data to the database (a database update), the client 110 returns the update token that it received during the read back to the server 120. The update manager 136 compares the update token received from the client 110 to a local copy of the update token. If they match, the server 120 may conclude that the target database object is unchanged from the time the client 110 read the record data to the present time. The requested write operation may be performed. If the update tokens do not match, the database has been changed. An error condition exists.

Several update control schemes are disclosed herein. In a first embodiment, the database 134 stores express version control information in association with the data records over which the update manager 136 has authority. The version control information may be used as an update token. This embodiment requires the database 134 to store version control information in the database as administrative data, which reduces the amount of database resources that are available for other uses. In another embodiment, the update manager 136 may generate an update token dynamically from substantive information contained in the data records. In a further embodiment, the update manager leverages change identifiers resident as substantive data within database objects. Each of these embodiments is described in further detail below.

FIG. 2 illustrates a method 200 according to an embodiment of the present invention, which may be invoked by an application to respond to a read request from a terminal. The read request identifies a data object being requested, typically by an object ID. In response, the application retrieves the requested data from its database and furnishes the data along with an update identifier (box 210). The method may determine whether the object possesses an update identifier (box 220). If not, the method 200 may calculate an update token based upon a hash function calculated with reference to data of the object (box 230). If so or upon conclusion of the token calculation, the method 200 may transmit the requested data to the client along with the update token.

The embodiment of FIG. 2 may manage updates to a group of databases which may have very different support for update management. As is known, some database systems may assign timestamps to data objects representing the time when data was most recently written to the object. Other database systems may include version counters that represent the number of times data has been written to a data object. For such database systems, the timestamps or version counters may be taken directly as update tokens and passed to clients.

Other database systems, however, may not include native support for update management. Update tokens may be derived from other sources in this instance. In some instances, for example, an application engine 132 may generate new documents whenever substantive information in the object is changed. Such procedures are common in financial applications, for example, when it is necessary not only to store complete copies of documents but also to identify complete document histories reflecting changes thereto. When such policies are in effect, new documents are created on each document change. Each new document is assigned its own document identifier. In such embodiments, the document identifiers may be taken as update tokens.

In still other applications, database object may not store any data to indicate when/how data therein is changed. In such an embodiment, an update manager 136 may calculate an update token dynamically from substantive data of the data object. For example, the update manager 136 may calculate an update token by applying a hash function to object data retrieved from the database. Hash functions typically generate a unique code in response to a unique set of input data. If a database object is changed, the code output by a hash function should be different than the code that would be generated before the database object was changed. Thus, the hash function code can be used as an effective update token.

The method of FIG. 2 finds particular utility in installations where a computer system employs a wide variety of database storage systems, some of which might have native support for update management and others that do not. The method of FIG. 2 permits an update manager to determine whether a retrieved data object already stores an update token—either as administrative data or as substantive data of the corresponding application—and, if not, to generate an update token dynamically.

Even in situations where a particular database natively supports update controls by timestamps or update counters, it may be beneficial to dynamically create update tokens using the hash function. In some instances, data inconsistencies might be tolerated within a data object for certain fields but not for others. In such a case, an update manager may apply the hash function to those fields for which data inconsistencies cannot be tolerated, omitted other fields. If a first update is directed to field which is not included in the calculation of the hash value, then the hash value will not change. A second update may occur to the data object even if the update is based on object data that had been read from the object before the first update occurred. The first update does not cause an error because it does not change the local hash value. The local hash value may be equal to the hash value returned as part of the second update. Thus, the hash function provides for a more graceful implementation of update management policies than other possible approaches.

FIG. 3 illustrates a method 300 of confirming a database write request according to an embodiment of the present invention. To request a write of data to a database object, a client may provide the data to be written and a copy of an update token that it had been provided when reading data from the database object at some point earlier (called the “client update token” for purposes of the present discussion). In response to the write request, the method 300 may retrieve a copy of the addressed database object from the database (box 310). The method 300 may determine whether the object has an update token, called the “local update token” for purposes of the present discussion (box 320). If not, the method 300 may calculate the local update token using data resident in the retrieved object (box 330). Thereafter or if, at box 320, the object had a local update token, the method 300 may compare the local update token to the client update token (box 340). If they match, the method 300 confirms that the requested write operation may proceed (box 350).

If the local update token does not match the client update token, it indicates that the contents of the database have been changed in the time period between the client's read of data from the object and the present time. If the requested write operation were permitted, it could cause data consistency problems. Accordingly, the method 300 may transmit a copy of the object and the local update token back to the client 300 or simply indicate an error. The method 300 may conclude.

If an error is detected at box 360, a user interface at the client side (not shown) may indicate that the requested write could not be completed and display new data of the object. The user interface may provide a prompt that displays current object data and permits the user to confirm that the write request be resubmitted. If such confirmation is received, the client may resubmit the write request with the new update token. The method 300 of FIG. 3 may be repeated and, if it completes without error, the write request may be performed at box 350.

FIG. 4 is a dataflow diagram 400 illustrating exemplary communication flow among three elements of a computer network: two client terminals 410A, 410B and a server 420. For the purposes of the present discussion, it may be assumed that the server 120 executes an application having an update manager integrated therewith and stores application data in a database (components not shown in FIG. 4). The initial value of an update token may be assumed to be 1. As shown in FIG. 4, both clients 410A, 410B may issue read requests to a common data object (452, 456) and may be provided copies of the requested objects and the update token (454, 460). During the ordinary course of operation at each of the clients, the requested data may be updated. One of the client terminals (say, client 410B) may transmit a request to store its updated object data at the server (462). The client 410B furnishes its copy of the update token as well. An update manager within the server may compare the furnished copy of the update token with its local copy, determine that they match and, therefore, may permit the updated data to be stored (464). At this point the update token may be changed, either expressly or by implication due to the changes to the data object (represented by 466).

Thereafter, client 410A sends its store request to the server 120 with its now stale version of the update token. The server's update manager may compare the furnished copy of the update token with its local copy, determine that they do not match and generate an error condition (470). The server 420 may furnish a current copy of the data object and the updated update token to the requesting client 410A (472). Thereafter, an operator at the client terminal 410A may be prompted to confirm the initial write request (474). If the operator confirms the request, the store request may be resent to the server 420 along with the new update token (476). The server may compare the furnished copy of the update token with its local copy, determine that they match and store updated data to its database (478). At this point the update token may be changed again (480).

If at 474 an operator decides against confirming the initial store request, the communication flow may end.

As another advantage of the foregoing embodiments, note that the client's role in the communication flow is identical no matter what type of update token is used. The client merely caches an update token and returns it to the server when making a subsequent write request. The protocol remains the same for update tokens that are timestamps, update counters, hash codes and document identifiers. Thus, no changes need to be made for deployed client terminals in a network even if substantial upgrades are made to the databases underlying the network applications.

Several embodiments of the invention are specifically illustrated and/or described herein. However, it will be appreciated that modifications and variations of the invention are covered by the above teachings and within the purview of the appended claims without departing from the spirit and intended scope of the invention. 

1. A data consistency method for a data storage device, comprising: responsive to a request to store data on the device, retrieving a data object referenced by the request, generating an update token from the retrieved data, comparing the generated update token against another update token received in the request, if they match, storing data received in the request in the database.
 2. The method of claim 1, wherein the update token is generated by applying data of the data object to a hash function.
 3. The method of claim 1, wherein the update token is a timestamp stored in the database in association with the data object.
 4. The method of claim 1, wherein the update token is a count value representing a number of times the data object has been updated in the database, the count value stored in the database in association with the data object.
 5. The method of claim 1, wherein the update token is a document identifier.
 6. The method of claim 5, further comprising, if a match is detected, generating a new data object from the retrieved data object and data of the request and assigning a new document identifier thereto.
 7. The method of claim 1, further comprising, if a match does not occur, transmitting the retrieved data object and the generated update token to a sender of the request.
 8. The method of claim 1, further comprising, after the storing, updating the update token and storing it in the database.
 9. A database management method, comprising: in response to a request to read data from a database, reading an object referenced by the request, determining whether the object possesses an update token, if not, generating an update token by applying object data to a predetermined function, and sending the retrieved data object and the update token to a sender of the request.
 10. The method of claim 9, further comprising, in response to a subsequent write request received from the sender that addresses the object and contains the update token, retrieving the object from the database, generating a current update token from the retrieved data, comparing the current update token against the update token received in the request, and if they match, storing data received in the request in the database.
 11. The method of claim 10, further comprising, if the current update token does not match the update token received in the request, transmitting the retrieved data object and the current update token to a sender of the request.
 12. The method of claim 10, wherein the update tokens both are generated by applying a hash function to the data retrieved from the data object at each respective time.
 13. The method of claim 10, wherein the update tokens both are timestamps stored in the database in association with the data object at each respective time.
 14. The method of claim 10, wherein the update tokens both are count values representing a number of times the data object has been updated in the database, the count value stored in the database in association with the data object at each respective time.
 15. The method of claim 10, wherein the update tokens are document identifiers stored in the database in association with the data object at each respective time.
 16. A computer system, comprising one or more integrated servers executing: applications to be used asynchronously by a plurality of distributed operators, the applications including execution engines and associated data sets, and an update manager to govern update operations from disparate users to the data sets, the update manager adapted to: respond to a request to update a data set by retrieving a data object referenced by the request, generating an update token from the retrieved data, comparing the generated update token against another update token received in the request, if they match, storing data received in the request in the data set.
 17. The computer system, wherein the update manager further is adapted to: respond to a request to read data from the data set by reading an object referenced by the request, determine whether the object possesses an update token in the data set, if not, generate a read update token by applying object data to a predetermined function, and send the retrieved data object and the read update token to a sender of the request.
 18. The computer system of claim 16, further comprising a plurality of operator terminals in communication with the server(s). 